Machine Translation of Noun Phrases from Arabic to English Using Transfer-Based Approach

نویسندگان

  • Omar Shirko
  • Nazlia Omar
  • Haslina Arshad
  • Mohammed Albared
چکیده

Problem statement: Any Arabic to English Machine Translation (MT) system should be capable of dealing with word order which Arabic exhibits. This poses a significant challenge to MT due to the vast number of ways to express the same sentence in Arabic. The ordering features are very important and should be carefully applied to ensure the generation of sentence in the target language. Because they apply to the target language, it should fulfill the specific requirement of this language. Mistakes in the MT output can be either the result of analysis problems at the source language level, or due to generation problem at target language level. Word order rules are crucial for the generation of sentences in the target language. They also serve as rules for the ordering of sentence constituents. These rules draw their information from the syntactic knowledge. The word order problem becomes more obvious when making machine translation between languages that have rich morphological variations. Approach: The main objective of this research is to develop a machine translation that translates Arabic noun phrases into English by using transfer-based approach. A system called NpaeRbmt has been developed in this research. Transfer-based machine translation is one instance of rulebased machine-translation approaches and is currently one of the most widely used methods of machine translation. The idea of transfer-based machine translation it is necessary to have an intermediate representation that captures the “meaning” of the original sentence in order to generate the correct translation. Using advantages of transfer-based machine translation such as analysis step, the Transfer-based becomes simpler as linguistic analysis goes deeper-as the representation of analysis step becomes more abstract. In fact, a major goal of MT research is to define a level of analysis which is so deep in which transfer-based machine translation is able to do. Results: The method was tested on 88 thesis titles and journals from the computer science domain. The accuracy of the result was 94.6%. These results proved the viability of this approach for distant languages. Conclusion: Based on the achieved results, we have managed to perform the syntactic reordering within an Arabic noun phrases to English translation task by using transfer-based machine translation and also achieved reasonable improvements in translation quality over related approach.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rule Based Machine Translation of Noun Phrases from Punjabi to English

The paper presents automatic translation of noun phrases from Punjabi to English using transfer approach. The system has analysis, translation and synthesis component. The steps involved are pre processing, tagging, ambiguity resolution, translation and synthesis of words in target language. The accuracy is calculated for each step and the overall accuracy of the system is calculated to be abou...

متن کامل

Noun Phrase Reference in Japanese-to-English Machine Translation

This paper shows the necessity of distinguishing different referential uses of noun phrases in machine translation. We argue that differentiating between the generic, referential and ascriptive uses of noun phrases is the minimum necessary to generate articles and number correctly when translating from Japanese to English. Heuristics for determining these differences are proposed for a Japanese...

متن کامل

Use of Articles in Learning English as a Foreign Language: A Study of Iranian English Undergraduates

The significance of error analysis for the learner, the teacher and the researcher is now widely recognized. Earlier studies of error analysis concentrated on intersystematic comparison of the “native language” and the “target language” and drew the required data largely from intuitions and impressionistic observations. This study was conducted on the basis of the following observations: (1) to...

متن کامل

A Hybrid Approach Using Phrases and Rules for Hindi to English Machine Translation

The present work focuses on developing a hybrid approach for developing a machine translation (MT) scheme for automatic translation of Hindi sentences to English. Development of machine translation (MT) systems for Indian languages to English almost invariably suffers from the limited availability of linguistic resources. As a consequence, statistical, rule-based or example-based approaches hav...

متن کامل

Example-based Translation without Parallel Corpora: First experiments on a prototype

For the METIS-II project (IST, start: 10-2004 – end: 09-2007) we are working on an example-based machine translation system, making use of minimal resources and tools for both source and target language, i.e. making use of a target language corpus, but not of any parallel corpora. In the current paper, we present the results of the first experiments with our approach (CCL) within the METIS cons...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010